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1. Introduction 

In 1958, while working as an assistant to Professor N. Obreshkov, Blagovest Sendov 
raised the following question. Assume that all zeros of a polynomial F with complex 
coefficients lie in the closed unit disk. Is it true that there exists a critical point in 
every disk of radius one centered at a zero of F? The conjecture was included in 
the 1967 book of research problems collected and edited by W. Hayman [31 . The 
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conjecture was wrongly attributed to Ilieff, who in fact only had communicated 
the problem to the world of mathematicians. 

Sendov's conjecture naturally fits into the line of classical works on the geom- 
etry of polynomials and at the same time it has close connections with potential 
theory and operator theory, allowing non-trivial matrix theory reformulations. In 
spite of many ingenious ideas originating in attempts to solve Sendov's conjec- 
ture, only partial results are known today, see for instance the respective chapters 
in the monographs [39l |47l [51] . It was the young Julius Borcea who pushed the 
positive solution to this problem up to degree seven [8] , and it was the mature 
Julius Borcea who outlined a series of extensions and modifications of Sendov's 
conjecture in the context of statistics of point charges in the plane. The authors 
of the present note had the privilege to work closely with Borcea on this topic, 
individually, and on two occasions in full formation (generously supported by the 
Mathematics Institute at the Banff Center in Canada, and by the American Insti- 
tute of Mathematics in Palo Alto). Without aiming at completeness, the following 
pages offer a glimpse at Borcea's beautiful new ideas. 

To understand the present status of Sendov's conjecture we outline first the 
main contributions scattered through its half century history. In the late 1960s 
soon after the publication of Hayman's book, a series of papers by several au- 
thors proved Sendov's conjecture for polynomials of degree three, four and five in 
quick succession. Sendov's conjecture for polynomials of degree six turned out to 
be significantly more difficult; its solution in Julius Borcea's first research paper 
appeared more than a quarter century later in 1996 [7 . In his next paper pub- 
lished in the same year, he proved that the conjecture is true for polynomials of 
degree seven [8]. It is now known that Sendov's conjecture holds for polynomials 
with at most eight distinct zeros [T7]. In the twenty- five years between the proofs 
for polynomials of degree five and those of degree six, some other special cases 
of Sendov's conjecture of were solved; these tended to be classes of polynomials 
defined by conditions on the geometry of their zeros rather than on their degree. 
For instance, Sendov's conjecture is true in the following cases: 

- all zeros are real [47], 

- for all trinomials [52], 

- all zeros belong to the unit circle [29] . 

- all zeros are contained in the closed unit disk, and at least one lies at the 
center of the disk [5] , 

- if the convex hull of the zeros form a triangle |53j . 

The contents are the following. We introduce Borcea's family of conjectures 
in the next section. We verify the p = 2 version of Borcea's conjecture for poly- 
nomials with at most three distinct roots in section three and the even stronger 
p — 1 version of Borcea's conjecture for cubic polynomials in section four. The 
connection between these family of conjectures and the theory of univalent func- 
tions is explored in section five. The next three sections detail various operator 
theoretical approaches to these conjectures and other results in the analytic theory 
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of polynomials. Using the logarithmic derivative, it is easy to see that the zeros of 
the derivative of a polynomial can be viewed as the zeros of the Cauchy transform 
of a sum of point measures at the zeros of the polynomial. In section nine, we 
generalize Borcea's conjectures to the zeros of Cauchy transform of weighted point 
measures. Our concluding section is somewhat more speculative; we look at some 
very interesting ideas in the mathematical literature which could prove useful for 
tackling these questions. 



2. The p- variance conjectures 

Since we are interested in the geometry of zeros of a polynomial, it is natural to 
introduce some quantitative estimates of their relative position in the plane. We 
denote by V(F) = {z E C : F(z) = 0} the zero set of a polynomial F £ C[z]. 

Definition 2.1. Let F(z) = allfc=i( z — z k) be a polynomial and let p € (0,oo). 
Define the p-variance of the zero set of F by o~ p (F) = min ce c(i XX=i \ z k ~ c\ p )p . 

Certain values of cr p (F) will be of particular importance to us: (72(F) is the 
variance and o~\(F) is called the mean deviation. We define a 00 (F) as: 

g 00(F) = min max \zu — c\ 

c l<k<n 

and we identify it as the Chebyshev radius of the set V(F); the optimal value of c 
is known as the Chebyshev center of the set V(F), see for instance [16] or ample 
references in the Banach space literature [48] . 

Lemma 2.1. For every polynomial F and < p < q < 00 we have 

^ P (F)<a q (F)<a 00 (F). 

Proof. Let c be an optimal center for o~ q : 



1 71 

^(F) = (-El^- c H i 



n 
fc=l 



In virtue of Holder's inequality: 

\ z * " C \ P ^ E |zfe - c\ p ^T /q n 1 ' p/q , 

k k 

hence 

a p (F) <[±J2 \** ~ C H 1/P ^ £ E I** ~ c H 1/? = °^ F ) 

k k 

and the first inequality follows. 

The second inequality is obtained similarly, starting with an optimal center 
c with respect to o-qo. □ 
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Recall that in the case of variance 02 the optimal center is the barycenter of 
the zeros of F 

1 - 

E(F) = -^z k . 
fc=l 

Indeed, the reader can verify with basic inner space techniques that 

n n 

min^|z fc -c| 2 = ^|z fc -£;(^)| 2 . 

k=l k=l 

Note also that in the above definitions we allow multiple roots, so that 
a p (F), < p < 00, really depends on the polynomial F rather than its zero 
set V(F), although in the literature one also finds the notation a p (V(F)). Only 
<7oo(F) is intrinsic to V(F). 

We define h(F, F') to be the unsymmetrizcd Hausdorff distance between the 
zero sets V(F) and V(F'): 

h(F,F'):— max min \z — w\, 

F(z)=0 F'(w)=0 

while H(F,F') :— m&x(h(F, F'), h(F' , F)) is the symmetrized Hausdorff distance 
between the the two sets. Alternatively, we will use the more suggestive notation 

H(V(F),V(F'))=H(F,F'), 

and all related variations. 

A series of conjectures, derived by Borcea from the Ilieff-Scndov original 
problem, are stated below. 

Conjecture 1 (Borcea variance conjectures). Let F be a polynomial of degree at 
least two and let p € [I, 00). Then h(F, F') ^ o~ p (F). 

When p = 00, this is Sendov's original conjecture. In view of the above 
lemma, the validity of Borcea's conjecture for a specific value of p implies that 
the inequality holds for all q > p, including q = 00. We will see shortly that the 
conjecture is false for p < 1. The problem regarding the extremal configurations 
for Sendov's Conjecture was raised by Phelps and Rodrigues [45] as follows. 

Conjecture 2. Ifp > 1, then equality in the above conjecture occurs if and only if F 
is of the form a(z — c) n — b for some complex numbers a, b, c, whenever deg F < n. 

Example 1. Let n > 2 and F(z) = z(z - I)™" 1 . Then F'(z) = (z - l) n - 2 (nz - I), 
so that V{F) = {0, 1} and V(F') = {i, I}. We derive from here that 



h{F,F') = -, a ao (F) = l, 



and 



°v(F) = ~T7p min c l P + (n - 1)|1 - = -tjz min K + (n - 1)(1 - c)Y /p - 

n L / p c n L ' p ce[0,i] 

lip= 1, then <Ji(F) = — min ce [ 0j i] [n— 1 — c(n — 2)] = —, therefore Conjecture 
[Udoes not hold for p = 1, while a p (F) > (Ti(F) = h(F,F')\or all p > 1. 
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Assume that p < 1. The only critical point in the interval c £ [0, 1] of the 
function (p(c) — c p + (n — 1)(1 — c) p is c = 1+ ^ t _ 1 1 ^ 1/(p _ 1) , with <p(c) increasing on 
the interval [0, cq] and decreasing on [cq, 1]. Thus 

showing that Conjecture Q] cannot be extended to the case p < 1. 

Another question which may be of interest is whether asymptotic versions of 
conjecture [TJ can be proven. For instance: 

Conjecture 3. For all n ^ 2,p > 1, there exists C ritP ^ 1 with limn^oo C ntP = 1 
such that h{F,F') ^ C n ^ p o~ p {F) provided deg(F) < n. 

i 

The case p = oo was validated in [5] by the value C niOQ = 2~ . 
To a polynomial F(z) = I1a;=i( z— z k) with derivative F'(z) — n nj=i( z — w j) 
we associate the discriminant: 

Discr(F) J]^ ~ ^ 

fc>f 

and the resultant: 

n n— 1 

r(f,f') : =nn^-o 

fc=i i=i 

The fundamental relation between the two is: 

|Discr(F)| =n n \R(F,F')\. 

The proof of this identity follows from the observation that F'(zj) — Y[i^j( z j ~~ 

z i) = n Ilt=i 1 ( z j ~w k ). 

When all points are in the closed unit disk, the maximum value of the dis- 
criminant is attained for z n — 1, for which we get Discr(z" — 1) = n n (equivalently, 
equally spaced points are Fekete points for the disk) . Thus we immediately obtain 
that if all the zeros of the polynomial F are in the unit disk, then the zeros Wj 
of F' must satisfy 

n—1 n n—1 

3=1 fc=l j=l 

Thus we infciQ that Sendov's conjecture is true for at least one zero of F and 
a little more: there exists a zero of F with the property that the product of all 
distances to the critical points of F is less than or equal to 1 . 

The rest of the present article focuses on Conjecture Q] in the two most im- 
portant cases p — 1 and p = 2. 



1 This also follows easily from the Gauss-Lucas Theorem and the geometric fact that any point 
of V(-F') that is further than rTco(F) from V(F) cannot be in the convex hull of V(F). See also 
Proposition 4.1. 
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3. Gauss-Lucas matrices 

In order to prove the variance conjecture for polynomials with at most three dis- 
tinct zeros and for polynomials with all zeros real we need the following definition. 

Definition 3.1. Let F(z) = Uj=i( 

z — Zj) mj where all the Zj are distinct. Let 
Wi, w%, Wk—i be the zeros of F' that are not zeros of F, counted with multi- 
plicities. We define the Gauss-Lucas matrix to be the [k — 1) X k matrix G with 
entries 

_ rrij\wi - Zj\~ 2 

9ij _ Y^ I 1-2' 

L«=l m a\Wi - Z a \ 1 

We note that the Gauss-Lucas matrix is a stochastic matrix which maps the 
vector of zeros of the original polynomial into the vector of zeros of its derivative: 
specifically, G has only non-negative entries, the sum of all elements of any row of 
G is equal to 1, and w — Gz (where w = {w\, Wk~i) T and z = (z%, Zk) T are 
column vectors). Indeed, one starts with the identity 

Q = F'(Wj) = Wij = m 2 {w l - Zj) 



F{wi) 



which yields 



fc fc 

a=l j=l 



In particular we notice that every Wi is a convex combination of the z'jS. 

Apparently it was Cesaro who first wrote the above system of equations 
relating the zeros of a polynomial to its critical points |18j . Thus we have proved 
the following classical result. 

Theorem 3.1 (Gauss-Lucas). The critical points of a polynomial lie in the convex 
hull of its zeros. 

Definition 3.2. Let F(z) = T\j=i( z ~ z i) m3 wh 

ere all Zj are distinct. We define 
the augmented Gauss-Lucas matrix A of F to be the k x k matrix whose first k — 1 
rows consist of the Gauss-Lucas matrix G and whose last row is defined as follows 

1714 

a kj := -f. 

Since every row or column vector of a unitary matrix has Euclidean norm 
equal to one, we have the following well known result, see for instance [351 section 
6.3]. 

Lemma 3.1. Let M — [my] be any n x n matrix and define $(M) to be the n x n 
non-negative matrix whose (i,j)-th entry is |m.jj| 2 . If U is a unitary matrix, then 
<&([/) is doubly stochastic, i.e. all entries of <&(U) are non-negative and that the 
sums along every column or row equal 1. 
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We are now able to prove the key technical result in the case of three distinct 
zeros. The case of two distinct zeros can be reduced via a rotation to the case of 
real zeros, to be discussed in full below. 

Lemma 3.2. Any polynomial F with at most three distinct zeros has an augmented 
Gauss-Lucas matrix which is doubly stochastic. 

Proof. Let z%, z 2 and Z3 be the zeros of the polynomial F with multiplicites mi,m 2 
and 771,3 respectively. Let 

Jrh~i JmTi Jrh~i T Jm{ Jrrvi Jmi 

"1 = (— , — , — ) , W2 = (— , — , — ) 

w\ — z\ Wx — z 2 wi — Z3 w 2 - zi w 2 - z 2 w 2 - z 3 

(note that we take the transpose for v\] but we take the conjugate transpose for 
v 2 ) and V3 = (y/mi, y/m 2 , \frh~s)- These three vectors are mutually orthogonal and 
therefore {z)i} 3 =1 is an orthonormal basis of C 3 , where Vi — jj^jjy- Let U be the 
unitary matrix whose rows are Then the augmented Gauss-Lucas matrix 

is $>(U) which is doubly stochastic. □ 

We note that the fourth degree polynomial F(z) — z 4 — 3z 2 — 4 has an 
augmented Gauss-Lucas matrix which is not doubly stochastic so the above lemma 
does not generalize to higher degree polynomials. However, we are more fortunate 
in the case of purely real zeros. 

Lemma 3.3. Any polynomial F all of whose zeros are real has an augmented Gauss- 
Lucas Matrix which is doubly stochastic. 

Proof. Let {zj}JL 1 be the zeros of the polynomial F, rrij be the multiplicity of 
Zj and let {^j}™^ 1 be the zeros of F' which aren't also zeros of F. Let vj — 
( v/ " r *~ , v/ "^ , .... v/ "" ) T when j ^ k— 1 and let Vk = (Jmi, \/m 2 , Jwik). 

Then {vj}j =1 is an orthogonal set. Now, the same argument as in the proof of 
Lemma 3.2 yields the result. □ 

Remark that the vectors Vj have the following operator theoretic interpre- 
tation. For simplicity we assume no multiple zeros. Let D = diag(2i, Z2, z n ). 
Let P be the projection on to the orthogonal complement of the vector w — 
77=(1, 1, 1)- Let B = PDF . According to [44], the eigenvalues of B are 
w\ 1 w 2 , w n —i, 0. It can be seen that Vj is an eigenvector of B corresponding to 
the eigenvalue Wj and v n is an eigenvector of B corresponding to the eigenvalue 
0. If A is Hermitian, then so is B, therefore the vectors Vk are orthogonal. If A is 
not a linear function of a Hermitian matrix, then B is not normal by a Theorem 
of Fan and Pall [53] (this is also proved directly in [S]). We will return to this 
point of view in a subsequent section. 

Theorem 3.2. Any polynomial F with at most three distinct zeros satisfies Borcea's 
2-variance conjecture. 
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Proof. Let Zj be a zero of F. We prove that there is a zero of F' which lies in the 
disk centered at z of radius &i (F). If Zj is a multiple zero then we are done, so let us 
assume that Zj has multiplicity one. Then by Lemma |3.2[ E-t=i 9ij = !1 ir~' nence 
there is at least one % : 1 < i ^ 2 such that gij ^ \. Let E = E(F) = Efc=i I ir z k- 
Then 

1 ^ Wi - z k 1 ^ E - Zk ,1 Wi - E 1^ E - z k 

1 = - > ma = - > TOfe h- > mfc = - > TOfc . 

n f-J Wi - z fc n f— ' Wi- z k n f-J ^ - z fe n f— Wi - z k 

k—l k=l k—1 k=l 

Thus, by Cauchy-Schwarz inequality and in view of our choice of the index i, we 
obtain that: 

3 3 

1 < {^^m k \E ~ z k \ 2 ){^2 mk \ Wl - z k \- 2 ) < l^-^l-VaCF) 2 . (3.1) 
fc=i fc=i 

Retaining the extreme terms in the inequality we obtain \wi — Zj\ ^ o~2{F). 

Remark that equality would require that no entry in the jth column of the 
Gauss-Lucas matrix be strictly larger than i which would force n to be equal 
to three and all of the Zi, z 2 and z 3 to be equidistant from it)j. By examining 
the coefficient of the polynomial G(z) = F(z ~ Wi) one can show that F(z) = 
a(z — c) 3 — b for some complex numbers a, &, c. □ 

Furthermore, as we saw, the above Lemma implies that Borcea's variance 
conjecture holds for polynomials with real zeros. 

We note that we do not need the augmented Gauss-Lucas matrix to be dou- 
bly stochastic for the above proof to work; any nth degree polynomial which has a 
Gauss-Lucas matrix in which every column has an element greater than or equal 
to — would satisfy Borcea's 2- variance conjecture. Unfortunately, even this weaker 
conjecture is not true in general as it fails for the 19th degree polynomial found 
in g3]. This polynomial (after translation) is F(z) = z 19 - 0.881444934z 18 + 
0.896690269z 17 - 0.492806889. The two columns of its Gauss-Lucas matrix corre- 
sponding to the conjugate pair 0.909090818 ± 0.330014556i of zeros of F have all 
entries strictly less than 

It would be interesting to understand what other conditions besides stochas- 
ticity Gauss-Lucas matrix must satisfy. For instance, what can be said about its 
column sums? What about the size of the maximal elements in each column? 



4. The mean deviation conjecture 

We begin with a brief review of apolarity, a key concept in the geometry of poly- 
nomials, see for instance [551 Wf\ I51j . 

Definition 4.1. Let F(z) = ELo (l) a k zk and G ( z ) = ELo (l) b k zk - Then F ( z ) 
and G(z) are said to be apolar provided that Efc=o( — {j^) a kb n -k = 0. 
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Theorem 4.1 (Grace's Theorem, 1902). If F and G are apolar polynomials then 
any closed circular domain which contains all zeros of G contains at least one zero 
ofF. 

There are many equivalent forms of Grace's Theorem [30] proved by among 
others Szego and Walsh (see section 3.4 of [37] for details). At the time of writing, 
the most recent new proof of Grace's Theorem is by Borcea and Branden and can 
be found in |14j . Grace's Theorem and apolarity are used to prove many results 
in the analytic theory of polynomials, including many of the known special cases 
of Sendov's conjecture. A striking example of an application of apolarity to this 
conjecture is Borcea's necessary and sufficient conditions for Sendov's conjecture 
in [5] 

We first note that we can prove a mean deviation result for the reverse di- 
rection of the unsymmetrized Hausdorff distance. 

Proposition 4.1. Let F be a polynomial of degree at least two. Then h(F',F) ^ 
at(F). 

Proof. Let F(z) = Y[k=i( z ~ z fc)> where the Zk are not required to be distinct. Let 
w be a zero of F' that is not also a zero of F. Now pick c such that i Y^k=i \ z k~ c \ — 
<ri(F), since £Li = (c - = 0, we have 

1 = - > < cri(F) max 1 

n — ' W — Zfe l^fc^n \W — Zk\ 

□ 

Hence we can show for every p > 1 that H(F, F') ^ <J P {F) would follow from 
h(F,F')^a p (F). 

The mean deviation conjecture {p — 1) has been verified for cubic polynomials 
by J. Borcea. In the remainder of this section we discuss his proof [13J. 
First we define the circular deviation. Let 

n 

a, = {(Ci,6,...,Cn) e C n : |Ci| = |C 2 | - ... - Knl = 1 : = 0}. 

fc=l 

For F{z) = al\l =1 (z - z k ), set 

1 - 

CTcirc(^) = SUp{-| VC fc Zi| : (Cl,C2,-,Cn) € 

n * — ' 

fc=i 

The sets £l n for n — 2,3,4 can be described in simple geometric terms. 
Specifically, denote ( = (Ci , C2 , ■■■Xn) and £ = (£1,^2, • ••,£«); we say that £ and £ 
are equivalent if there exists c € C with |c| = 1 and a permutation r G S n such 
that Cfc = c£ t qa. Then ^2 is the set of all vectors equivalent to (1,-1); O3 is the set 
of all vectors equivalent to (1,uj,uj 2 ) where a; is a primitive cube root of unity; Q4 
is the set of all vectors equivalent to (1, —1, c, — c) where c is any complex number 
of modulus one. 
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It can easily be shown that er C irc (F) $J o\ (F) . The inequality can be however 
strict. For instance, in the case of F(z) — z 3 — z we have <T c i rc (i 7 ') = ^= and 

a 1 (F) = l 

Note that if F is a cubic polynomial and is one of its zeros, one may 
construct a 3 x 3 circulant matrix 

(—a — b a 
b -a-b 
a b 

whose characteristic polynomial is F. For the theory of circulant matrices we 
refer to the monograph [15]. Then any two by two principal submatrix of C has 
characteristic polynomial ^F'(z) = (z + a + b) 2 — ab. By remarking that the 
circulant matrix can be diagonalized by the Fourier matrix 



w 3 = l,w^l, 



one proves that <r c i rc (F) — max(|a|, Then one verifies that F'(z) is apolar to 
G{z) — (z — a)(z — b). Since the disk centred at zero of radius cr c i rc (F) contains 
both zeros of G, it must contain at least one of the zeros of F'. 




5. Univalence criteria 

A classical counterpart to Grace Theorem was discovered in 1915 by J. W. Alexan- 
der [1] and refined in 1917 by Kakeya [35] (see also [5TJ sec. 5.7]), as follows. 

Theorem 5.1 ( Alexander-Kakeya) . If a polynomial F of degree n has no critical 
points in a closed disk of radius R, then it is univalent in the concentric disk of 
radius J2sin(^). 

The extremal polynomials for the Alexander-Kakeya Theorem (the ones for 
which iisin(— ) cannot be improved) are the same ones as in Conjecture [5] a(z — 
c) n — b. We are naturally led now to formulate the following conjecture. 

Conjecture 4. If F is an nth degree polynomial with F(0) = 0, then F is not 
univalent in any closed disk of radius larger than c p (F) sin(— ), p > 1, centered at 
zero. 

Of course, if the conjecture is true for p = 1, then it follows for all p > 1. 
By the Alexander-Kakeya Theorem, the truth of this conjecture implies the truth 
of Conjecture [1] for the same value of p. It can be easily seen that the strongest 
case p = 1 of this conjecture is true for n = 2. In this case F(z) = z 2 — az and 
o~\{F) = \a\/2. Since F has a critical point at a/2, it is clearly not univalent in 
any disk larger than |a|/2, and so our conjecture holds. 
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This nonunivalence conjecture opens the possibility to use the huge body of 
known facts about the univalence of polynomials and analytic functions. Tradition- 
ally one normalizes the functions by the conditions F(0) — and -F'(O) = 1. In our 
polynomial case we simply consider F n (z) — zllfc=i(l — z k lz ) = z + ■ ■ ■ + a n z n . 
If such a polynomial is univalent in the open unit disk, then it is well-known, for 
example, that \a n \ < 1/n. For a survey of results on univalent polynomials, see 

m 



6. Perturbations of normal matrices 

We present below a different approach to Borcea's 2-variance conjecture derived 
from some recent matrix theory advances of Pereira [44] and Malamud [55] . We 
return to a monic polynomial 

F{z) = (z- Zl){ Z -Z2)...( Z - z n ), (6.1) 

and the variance of its zero set V(F) = {z\, . . . , z n } where we repeat the enumer- 
ation of multiple zeros. 

n 1 n 

fc=i fe=i 

where E — E(F) is the barycenter: 

1 " 
E=-Y j z k . 

k=l 

If now 

F'(z)=n(z-w 1 )...(z-w n - 1 ) (6.2) 

and V(F') is the sequence of critical points {uu\, . . . , it) n _i}, then recall that the 
2-variance Conjecture Q] states that 

H(V(F),V(F'))^a 2 (F), 

where H(-,-) is the symmetrized Hausdorff distance. 

The main idea is to associate with the polynomial (|6.ip a normal operator 
A acting in C" so that A is diagonal with entries z\, . . . , z n along the diagonal. 
In this case Fa(z) = F(z), where Fa(z) is the characteristic polynomial of A. 
According to the terminology introduced in [33], a unit vector v 6 C" is called a 
differentiator if 

-F / (z)=F B (z), 
n 

where B = PAP* and P is the orthogonal projection onto the orthogonal comple- 
ment v 1 - in C™ (so that B is acting in v- 1 ). A simple computation shows that any 
vector v = (vi,..., v n ) € C™ satisfying \vk\ — — != is a differentiator, see for the 
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proof the above section devoted to Gauss-Lucas matrices. Moreover, there is an or- 
thonormal basis of differentiators v^, vW, . . . , v'™ -1 ), where = (v[ l \ . . . , v$) 
and 

u<° = -L e 2 ™ M /". (6.3) 
V n 

We shall return to this basis later. 

In the above notations, if v is a differentiator and Q = I — P, then PAQ is 
a rank one operator 

PAQ : (x k )i4k4n !-> Xk Vk J ' (( z fc - £ ; )wfc)i^ fe ^„ (6.4) 
with the norm 

IIPAQH = a 2 (F) 

which gives a natural operator-theoretical interpretation of the variance. 
The conjectured inequality 

H(F,F') ^a 2 (F) 

reduces to two independent assertions: 

Statement (i): For any Z£ £ V(F), the disk D(z£, cr^F)) (centered at zt and 
having radius o~2(V(F)) ) contains at least one point from V(F'). 

Statement (ii): For any Wk G V(F'), the disk D(wk,o~2{F)) contains at least 
one point from V(F). 

We discuss next a matrix theory proof of Statement (ii), already settled in 
the previous section by a different method. The basic observation goes back to a 
paper by Ptak [SB] . Indeed, if we assume that w = is in V(F') and x G v 1 - is 
such that Bk = 0, then 

= PAx = Ax- QAx = Ax - QAPx, 

which implies 

\\A X \\ ^ \\QAP\\\\ X \\ = a 2 (F)\\ X \\ 

and hence A must have at least one eigenvalue z with \z\ ^ 02 (P) since A is a 
normal operator. 

The main difficulty in proving the 2-variance conjecture consists in proving 
Statement (i) . A standard reduction to the case where zi = transforms this ques- 
tion to the problem of proving the invertibility of a normal matrix under certain 
spectral conditions on its principal submatrices. Namely, let v^, v' 1 ), . . . , v*™ -1 ) 
be an orthonormal basis of differentiators and Pi denote the orthogonal projections 
to the hyperplanes v' 1 ' , and Qi = I — Pi. The proof of statement (i) then reduces 
to the following statement. 
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Conjecture 5. Let A be a normal matrix and let v^°\ v' 1 ' , . . . , v*-™ -1 ^ be an or- 
thonormal basis. Denote by P( the orthogonal projection onto v^-* . If all eigenvalues 
of the compressions Bg = PgAP^ lie outside the unit disk and \\(I — Pg)APi\\ ^ 1 
for all £, then A is invertible. 

This question is similar to known invertibility criteria for diagonally dominant 
matrices; this time we deal instead with codimention one principal submatrices 
rather than diagonal entries. 

The principal difficulty of this problem is that the matrices Bi are not normal 
and the challenge is to transform their known spectral properties into appropriate 
metric properties. In two special cases this can be done. Namely, we will give an 
alternate proof of Statement (i) (and hence the variance conjecture) in the case 
n = 3. Another special case is that of polynomials with real zeros. In this case, 
a much more stronger estimate than Statement (i) is valid (see Proposition 16.11 
below). 

6.1. A second proof of the 2-variance conjecture for n = 3 

A critical step towards the proof is contained in the next statement. 

Lemma 6.1. If B is an operator on C 2 such that all eigenvalues w of B satisfy 
\w\ ^ 1, then for any vector x £ C 2 

||Bxf + ||5*x|| 2 ^ ||x|| 2 . 

Proof. In an appropriate orthonormal basis B has the triangular form 

o _ ( 6ll &12 \ 

B -\ b 22 J' 

where &n and &22 are eigenvalues of B. The lemma follows now from the coordinate- 
wise computation of the left-hand side of the desired inequality. □ 

Remark 1. A similar inequality is no longer true in dimensions n ^ 3, even with 
a constant in front of ||x|| 2 on the right hand side. It is enough to consider 

I 1 -a a 2 
B = 1 -a 
\ 1 

and 

with a sufficiently large real a. 

In view of the discussion prior to Conjecture O the 2-variance conjecture in 
the case n = 3 is a consequence of the next result. 
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Lemma 6.2. Let A be a normal operator in C 3 and let Vi,V2, V3 be an orthonormal 
basis. For £ — 1,2, 3, let Pi be an orthogonal projection onto vj- and Qi = I — Pi- 
Assume that 

\\P e AQ e \\ 2 <1, £ = 1,2,3, 

and all operators Be = PeAPg acting in have their spectra outside the unit 
disk. Then A is invertible. 

Proof. Let e > be small enough, so that 

\\PtAQtW 2 < 1 - e, £=1,2,3. 

Let H denote C 3 and let Hi denote vj- for t = 1,2,3. The inequality 

HBiAxH 2 + H^Pixll 2 + ||P 2 P 2 x|| 2 + ||B 2 *P 2 x|| 2 + ||B 3 ^3x|| 2 + \\B;P 3 x\\ 2 > 

||P 1 x|| 2 + ||P 2 x|| 2 + ||P 3 x|| 2 = 2||x|| 2 

shows that the row operator 

1 — {±Si,±S 1 ,±S 2 , B 2 ,£S3 ,±S 3 ) , 

considered as acting from Hi © Hi ® H 2 © H 2 © H 3 © H 3 into H satisfies the 
estimate ||T*x|| 2 ^ 2||x|| 2 . 

The property that A is a normal operator implies 

\\PtA*Q t \\ = \\P t AQ t \\, £ = 1,2,3. 

Hence another row operator 

R = (QiA, QiA*,Q 2 A, Q 2 A* ,Q 3 A, Q 3 A*) 

(again acting from Hi® Hi® H 2 © H 2 © H 3 © H 3 into H ) satisfies the estimate 

||iTx|| 2 < (2 - 2e) (HQxxll 2 + ||Q 2 x|| 2 + ||Q 3 x|| 2 ) = (2 - 2 £ )||x|| 2 . 

This shows that T* + R* is bounded away from zero and hence T + R is surjective. 
Therefore, any x € H has the form 

x = Ay + A*z 

with appropriate y, z G if which implies that A is invertible since A is a normal 
operator. □ 

6.2. Polynomials with real zeros 

In the case of polynomials whose all zeros are real, a much stronger result, recently 
proved by Borcea, is valid. 

Theorem 6.1. 13 If all zeros Zi,...,z n of polynomial \6. 1\) are real, then for any 
zi G V(F) the disk D (zi, ) contains at least one zero of the derivative F' . 
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Proof. As before, we may assume that zg = and that u\ ^ 1 — e. We keep 
also notations from preceding proof. Assuming that all operators Bg do not have 

eigenvalues in the disk D (o, -j=fj , we shall see that A is invcrtible which gives 

a contradiction. 

Since all z\, . . . , z n are real, A is self-adjoint and hence all Bi are also self- 
adjoint. Therefore, 

||B /X || > -i=||x|| 
\Jn - 1 

for any x e which implies that 

n n 

^|| Bfe P fe x!| 2 ^— -^||P fe x|| 2 = ||x!| 2 . 
fe=i n 1 fc=i 

Hence the row operator 

T = (Bi,B 2 , . . . , B„) 
considered as acting from £Zi © i?2 © ■ ■ ■ © #n into 7? satishes the estimate 

||T*x|| > ||x|| 

for any x £ H. On the other hand, the row operator 

i? = (Qi A, . . . , Q n A) 

satisfies the estimate ||i?*x|| 2 (1 -e)||x|| 2 . Hence R* +T* is bounded away from 
zero which shows that R + T is surjective and hence A is invertible. □ 

Remark 2. The constant ^(-F 1 ) in Statement (ii) cannot be improved for polyno- 
mials with real zeros. A counterexample is F(z) = (z 2 — l) 2 and w = 0. 

6.3. Toeplitz matrix reformulation of the variance conjecture 

Statement (i) (and hence the 2-variance conjecture) is equivalent to the following 
clear matrix theory conjecture, originally identified by Borcea: 

Conjecture 6. Assume that n ^ 3, a\, . . . , a n _i G C and oq = — Y^k=i a k- Then 
the (n — 1) x (n — 1) Toeplitz matrix 

I a a x ... a„_ 2 \ 

dn-l a a l 

B = 

\ a 2 ••• a n _i a / 
has at least one eigenvalue A satisfying 

|A| 2 <X>| 2 - 

k=l 
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Indeed, the operator A in the orthonormal basis of differentiators V = 
{ v (°\ v^ 1 ), . . . , v^ 1-1 )} given by (|6.3[) has the circulant matrix 



A w = 



( a m ... a n _i ^ 
fln-i ao a i 



\ Oi 



a 2 



where 



1 ™ 



z k e 



ao J 

-2-rrikj /n 



fc=l 



so that the sequence (ao, . . . , a n -i) is the discrete Fourier transform of the sequence 
(zi, . . . , z n ). Assuming that z n = wc find the dependence ao = — J2k=i a k- Since 



\PqAQo , we see that 



k=l 



Finally the matrix B from Conjecture [5] is exactly the matrix of the operator 
PoAPq so that its eigenvalues are zeros of p'. 



7. Numerical range methods 

We recall the definition of the numerical range of a square matrix. Chapter 1 of 
[33] is a standard reference for this subject. 

Definition 7.1. Let A be an n x n matrix. The numerical range of A is the set 
W(A) = {x*Ax : x € C", ||x|| 2 = 1}. 

It can easily be seen that W(A) is a compact subset of the complex plane 
which contains the spectrum of A and that if B is a compression of A then W(B) C 
Toeplitz and Hausdorff independently have shown that W(A) is always 
convex. If A is normal, then W(A) is the convex hull of the spectrum of A. If 
A is a 2 by 2 matrix then W(A) is a (possibly degenerate) ellipse whose foci are 
the eigenvalues of A. To demonstrate the usefulness of numerical ranges in the 
analytic theory of polynomials, we will use them to give a short proof of Marden's 
Theorem. We recall that the Steiner inellipse of a triangle is the inscribed ellipse 
which touches each side of the triangle at its midpoint. 

Proposition 7.1. Let F(z) be a polynomial with three distinct zeros Zi, Z2, Z3. Then 
the critical points of F(z) are the foci of the Steiner inellipse of the triangle 
Aziz 2 z 3 . 

Proof. Let D be a three by three diagonal matrix with diagonal entries zi, z 2 , z%; 
then W(D) = A Zl z 2 z 3 . Let v = ^=(1, 1, 1) and let S = {x* Dx : x G C 3 , (x, v) 
0. ||x||2 = 1}. Since S is the numerical range of a two by two compression of D, 
S is an ellipse contained in Aziz 2 zs. Let w = -Wl,— 1,0). Since (w,v) = 0, 
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\{z\ + Z2) = w* Dw £ S. Similarly, 5(2:1 + 23), \{z2 + Z3) £ S and S is the Steiner 
inellipse of AZ1Z2Z3. Finally, as the projection onto v 1 is a differentiator of D, the 
foci of S are the critical points of F(z). □ 

The numerical range may also prove useful in attacking the following strength- 
ening of Sendov's conjecture due to Schmeisser |53) . 

Conjecture 7 (Schmeisser). Let F(z) be an nth degree polynomial with n ^ 2 and 
let £ be any complex number which is in the convex hull of the zeros of F(z). Then 
the closed disk centred at £ with radius c 00 (-F') contains a critical point of F. 

We note that (F) cannot be replaced by cr p (F) for any finite p in the above 
conjecture. (If n is sufficiently large so that ri -1 + 2n~p < 1, F(z) = z(z — l)' i_1 
will be a counterexample). 

Borcea has given a matrix theoretical generalization of Schmeisser's conjec- 
ture. Before stating it, we introduce the following notation. For annxn matrix A, 
we denote by S(A) its spectrum and by A% the (n— 1) x (n — 1) matrix obtained 
by removing the fcth row and fcth column of A. 

Conjecture 8. Let A be an n x n normal matrix. Then H(W(A),\J 1 ^_ 1 £(A/-)) ^ 
min c6C \\A - cl\\. 

Recall that for any n x n normal matrix A there is a set of differentiators 
of A which form an orthonormal basis of C™. If we only consider normal matrices 
for which every element of the standard basis is a differentiator then we recover 
Schmeisser's conjecture. 



8. Exclusion regions for the critical points 

So far we were interested in the shortest distance from a zero of a polynomial 
to its critical points. In the other direction quite a few results are known about 
exclusion of critical points from regions around the zeros of a polynomial. An 
ingenious combination of algebraic and geometric observations led J. von Sz. Nagy 
to the following result. 

Proposition 8.1. Let z\ be a zero of multiplicity m\ of the polynomial F(z) of 
degree n. Assume that the equation F(z) = admits at most s zeros on every side 
of a line passing through z\ and let K be a closed disk passing through z\ which 
does not contain any other zero of F. Then F'(z) ^ on the closed disk internally 
tangent to K , of diameter t ™^ s diam(K). 

Without reproducing here the complete proof (contained in |57)h we discuss 
only a beautiful geometric observation appearing in Sz. Nagy paper. Specifically, 
assume that wi is a critical point of F(z) = Y\l=i{ z ~ z k)i where the zeros zi, . . . ,z n 
are not necessarily distinct. If wi is not a zero of F, then 

F'(w e ) A 1 _ Q 
F(wi) k ~ 1 Zk ~ We - 
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Choose a real line L passing through W£ of slope e 1 ^ with respect to the positive 
real semi-axis, and write the polar decompositions of the denominators above with 
respect to the origin wg and L as zero-th direction: 

z k -w e = r k e i ^+^. 

By taking imaginary parts in the above identity we find 



E sm ip k _ 

fe=l K 

The diameter of the circle passing through z k and which is tangent to L at 

u>i is 

4 = 1 — 1, 

sm Lpk 

not excluding the value d k — oo if the point z k lies on L (that is sint^ = 0). Let's 
rearrange the zeros z k so that Zx,...,z q belong to one side of L and z n 
belong to the other side, so that 

< di < d-2 < ■ ■ ■ < d q , < d q+ i < d q+2 <••• < d n . 

Then we obtain the identity 

1 11 1 

di d q d q+ i dn 

From here Sz. Nagy obtains a series of non-trivial quantitative relations, of the 
form 

mi n-q m q+ i q_ 
d\ d q+ i ' d q+ i d\ ' 
where rrij is the multiplicity of the zero Zj . 

A consequence of the above proposition (and the preceding geometric reason- 
ing) is the following result, independently obtained by Alexander [I] and Walsh 
[59]. 

Theorem 8.1 (Alexander- Walsh). Let Z\ he a zero of multiplicity mi of the poly- 
nomial F(z) of degree n and let d denote the shortest distance from Z\ to another 
zero of F. Then F'(z) ^ for z belonging to the open disk centered at z\ of radius 
mi —. 

1 n 

We offer below an alternate matrix theory proof. For any m,p £ N with 
to ^ p, we define Q m ,p to be the set of all m-tuples of integers (3 = {fix, fa, /3 m ) 
satisfying 1 < /3j < (3 2 < ■ ■ ■ < (3 m -i < fim ^ p. If A is a p x p matrix and 
/3 € Qm,p, then |A[/3]| denotes the determinant of the to x to principal submatrix 
of A whose (i,j)th entry is the (/?i,/3j)th entry of A. We need the following well- 
known result: 

Lemma 8.1. [32] Let A be a p x p matrix and let X)fe=o a k zk be the characteristic 
polynomial of A. Then a p = 1 and a k — (— l) p ~ fc X^eg k \A[f3]\ for k > 0. 
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Let Ik denote the k x k identity matrix and let J k denote the k x k matrix 
all of whose entries are one. Using the previous lemma and some straightforward 
calculations one can prove the following: 

Corollary 8.1. Let D be a p x p diagonal matrix with characteristic polynomial G 
and let n > 0. Then the characteristic polynomial of (I p — ^Jp)D is (1 — £)G+^G'. 

We now prove Theorem O Let G(z) = z~ mi F(z + z x ). Then G(z) is an 
(n — mi)th degree polynomial with no zero in the open disk centered at the origin 
of radius d. Since F'(z+Z\) = z" 11 ^ 1 (niiG(z) + zG' (z)), we will be done if we show 
that m\G{z) + zG'(z) has no zeros in the open disk centered at the origin of radius 
mi — . Now let D be an (n — mi) x (n — m\) diagonal matrix whose characteristic 
polynomial is G(z). It is clear that ||-D _1 || = \- Let M = I n - mi — ~Jn-mi> then 
M is a positive definite matrix with one eigenvalue equal to ^ and all other 
eigenvalues equal to one. Hence ||M _1 || = The zeros of m x G{z) + zG'(z) 
are the eigenvalues of MD. Hence if z is a zero of m\G(z) + zG'(z), we have 
^ s: || (MD)- 1 ]] «; WM-'UD- 1 ]] = jJig and therefore \z] > m^. □ 

9. Cauchy transforms and the weighted variance conjecture 

The localization of zeros of the Cauchy transforms of positive measures is closely 
related to the similar problem for derivatives of polynomials. Indeed, given a poly- 
nomial (|6.1[) . we have 

F'(z) " 1 
F(z) ^z-z k 

and hence the zeros of F' can be interpreted as zeros of the Cauchy transform of 
the measure Y^,k—i ^ z k- 

In the case when the support of the measure is finite, the problem reduces to 
the question of localization of zeros of the Cauchy transform: 

n 
fc=l 

where /i = 52fc=i a kS Zk , with Z\, . . . ,z n € C and aj., . . . ,a n > 0. For simplicity, 
we may assume that J^ fc afc = 1 so that /i is a probability measure. As before, we 
may associate with the measure /x its baricenter 



and the variances 



E(fi) := ^ a k z k 
fe=i 



= ^c^-EM , l<p<co. 



\fe=i 
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At this point there are several possibilities to bring operator theory into the 
play. The first one is to consider the same normal operator A as before, i.e. the 
diagonal operator with zi, . . . ,z n on the diagonal. If the vector v € C" is defined 
as v = ^aj^ 2 , . . . , «r/ 2 ^ and P is the orthogonal projection to v- 1 , then one can 

easily check that the eigenvalues of the operator B = PAP* acting in v arc 
exactly the zeros of the Cauchy transform (19. lj) . Indeed, 



£ — = «A - *)~\ v) = d * t(B - Z l (9.2) 
j^[Zk-z u ' ' det(A-z) v ' 

Moreover, one has 

E(ri = (Av,v) 

and 

a 2 (ri = \\PA(I-P)\\ = \\(I-P)AP\\. 

With respect to the orthogonal decomposition C™ = {v} v , the operator A 
takes the form 

a - ( r % ) . (,3, 

where c.de v 1 - and ||c|| = ||d|| = <72(/i). By analogy with the invertibility crite- 
rion for diagonally dominant matrices we are led to the following statement: 



Claim 1. If \E(/i)\ > 02 (/u) and all eigenvalues w of B satisfy \w\ > 02 (/Lt), then A 
is invertible. 

If the above statement were true, then the following statement about the zero 
location of Cauchy transforms (|9.1|) would also be true: 

Claim 2. Let S(p) = {zi, . . . , z n } and let W e (fi) denote the set which is the union 
of zeros of the Cauchy transform i9. 1\) and the point E{fi). Then 

We will see shortly that the above Claim is false for n > 2. A natural attempt 
to save it would be to restrict both sets appearing in the inequality to some relevant 
subsets. First we adopt some general notation and restrict W e . For a compact 
subset K C C, let (Too(K) denote the Chebyshev radius of K, i.e. the radius of 
the smallest closed disk containing K. If fi is a probability measure, we denote by 
S(n) the closed support of \i and we put a oo (p) = cr 00 (S'(/i)), and W e {n) — {z e 
C\ C^(z) = {), z£S{n)}\J{E{ix)}. 

Claim 3. H(S(fi),W e (fi)) < a^). 

A three-point mass example shows that Claim (j3]) fails and hence so do 
Claims (JT] ) and ([2] ). Namely, the polynomial F(z) = (z — l){z 2 + 1)™ has the 
derivative F'(z) = (z 2 + l) ,l_1 [(2n + l)z 2 — 2nz + 1] and the associated measure 
I 1 = 27t+T Sl + Wfi S i + 2^TT S - 1 - Tnus ' Vocip) = 1, S(fi) = {l,i,-i} and W e (/x) = 
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"f 2nTT ± ^In+T 1 } U { 2^fl}- If «> 3, all points of W e (/i) are real and therefore 
the distance from ±i to W e (n) is strictly greater than 1 = a co (fi). This example 
also disproves a conjecture raised in [35] , 

Finally, we restrict by defining for a finite point mass measure /j,: 

s miB W = {Ce%); MIO) < MM), >«£%)}■ 

We are led to formulate another statement which still implies Sendov's conjecture. 
This time VQi) = {z <= C; C M (z) = 0, z ^ 

Conjecture 9. For a finite point mass probability measure /i, 

h(S m in(n),V((i)) < o-oo(n). 

Nattapong Bosuwan of Vanderbilt University has shown this conjecture holds 
in the case when the set S m i n (p) lies on the circle \z — c\ = a ao (fi), where c is 
the Chebyshev center of S(/i) as well as in the cases when S(fi) consists either of 
three point masses lying anywhere on the complex plane or an arbitrary number 
of collinear point masses. His proof of the first case is a generalization of the proof 
from 29J that Sendov's conjecture holds for zeros on the unit circle. We present 
Bosuwan's proof for the case where S(n) consists of three point masses. 

Proposition 9.1. [15] If /j, = mkS Zk , then Conjecture^ is true (in fact with 

o"oo(m) replaced by ct 2 (m))- 

Proof. Without loss of generality we assume that mi = 1, and 7712,7713 > 1 and 
proceed as in the proof of Theorem 13.21 Define vi = ( \/™>i V™3 \ 

V2 = ( , ^=4 , , , , , , ; ) and t; 3 = (Jmi, Jrn^, y/mi) where wi,W2 are 

the elements of V(fi). 

{v%, t>2, U3} are mutually orthogonal and {vi,V2,Va} is an orthonomal basis 



of where v, t = Then 



Vi 

U = I v 2 



V3 

is an unitary matrix. By Lemma 3.1, <p{U) is doubly stochastic. Let G := <p(U). 
Then g, Lj = ^^fmliwf-z^ ■> i = 1 -> 2 and 93j = ^£ wnere L = m \ + m 2 + 777,3. 
Since G is doubly stochastic, X^iff-ti = "TT"- This implies that there exists io € 
{1,2} such that g lQ i > ±. Let E = E{p). 

Then from equation 13.11 with 77 replaced by L and i replaced by io we get 

Ko - Zl\< (T 2 (/i) < 1. □ 

The Chebyshev radius o"oo(/Lt) admits an operator-theoretical interpretation. 
Namely, it was proved by Bjorck and Thomee [4] that for any bounded normal 
operator A in a Hilbert space H 

VooWA)) = inf \\A-aI\\= sup (\\Av\\ 2 - \(Av,v)\ 2 ) 1/2 . 



II s 1 
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If A is the operator of multiplication by an independent variable in £ 2 (/i) for a 
compactly supported probability measure fi we have 

a oc (fj,) = sup (\\Av\\ 2 -\(A Vl v)\ 2 ) 1/2 = sup{a 2 {is) : v<n}. 

ll«Ki 

Since we are dealing with matrix interpretations, an alternative possibility is 
to define the zeros of the Cauchy transform (|9.1[) as a spectrum is to use rank one 
perturbations of the above operator A : (aij)i^z^ n H> (2;£i)i<Z<n- Given a € C, 
we introduce the rank-one operator T a as 

n 

fc=i 

One can check that the spectrum of the operator A+T a is the set {a, wi,..., iu„_i}, 
where 101, . . . , w n -i are zeros of (19. ip . In particular, if a = E(ji), then a(A + 
^E<n)) — W e ([x). Moreover, in this special case we have in addition the properties 
T, l( M )=0and \\T EM \\ = ( t 2 ( M ). 

Sadly, in the labyrinth of observations we have stated and partially disproved 
in this section, we do not yet know how to state a weighted variance conjecture for 
probability measures supported by a continuum rather than finitely many points. 

10. Concluding remarks 

10.1. An indefinite inner product condition which would imply the 2-variance 
conjecture 

As was noticed above, the main difficulty in proving Conjecture [5] consists in 
transforming known spectral properties of operators Bi to metric properties. As 
an alternative, one can search for those metric properties of Bi which would imply 
desired spectral properties. We describe one possible scenario. 

As usual, we make first the reduction to the case where zi = and cr| = 1 — e. 
Further, it is known from matrix inertia theory (see, e.g. |371 chapter 13]) that the 
following condition is sufficient for the existence of an eigenvalue of an operator 
B (acting in some Hilbert space H) inside the unit disk: there exists an indefinite 
inner product (•, •) in H such that 

• (£>x, Bx) ^ (x, x) for any x € H ; 

• (x ,x ) > for some x G H. 

Therefore, one way to attack the variance conjecture by operator theoretical 
methods might be trying to construct such an inner product. 

10.2. Maxwell's conjecture 

In connection with the discussion of the weighted variance conjecture in Section 
9, it seems natural to raise here a question of plausible extensions to higher di- 
mensions. Note that the rational function (|9.ip . or more generally, the Cauchy 
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transform 




where [i is any compactly supported probability measure in C, can be interpreted 
as the force field (i.e., the complex gradient J^) of the logarithmic potential 



In this context, the Gauss-Lucas Theorem (for probability measures (i) is 
obvious. The points where the gradient of the potential vanishes, i.e., the zeros 
of the Cauchy transform of the measure /i, all lie in the convex hull of the 
support of fi. (At any point zq outside of the convex hull there would be a non- 
trivial component of the force field C p "pushing" a probational positive charge 
placed at zq off to infinity.) 

In this form, the result extends word for word to higher dimensions if we re- 
place the logarithmic potential by the Coulomb (or, in the context of gravitational 
force, Newtonian) potential in R™ thus replacing L M by 



for n > 3. Some further generalizations of the Gauss-Lucas theorem to higher di- 
mensions can be found in [27], also cf. [20]. However, it turns out that for atomic 
measures \x even the estimate for the total number of critical points where the 
gradient of the potential vanishes, i.e., no force is present, is not known. A conjec- 
ture going back to Maxwell [UJ asserts that if the measure /i consists of N point 
charges (not necessarily all positive), the total number of isolated critical points 
of the potential does not exceed (N — l) 2 . Of course, for C fll which is a rational 
function in C of degree N with a zero of order N at oo, the total number of finite 
critical points equals (2N — 2) — (N — 1) = N — 1 as follows at once from the 
Fundamental Theorem of Algebra. 

Note that in R", n > 3, the critical points of the potential can form lines or 
curves as well, e.g., if the +1, —1 charges alternate at the vertices of a square, the 
line through the center of the square which is perpendicular to the plane of the 
square will consist entirely of critical points of the potential. Examples of bounded 
curves, like circles and ellipses, of critical points in the electrostatic fields generated 
by three charges can be found in [34], also see [36] which is more easily accessible 
for an English speaking reader. 

Maxwell simply asserted the claim without giving any justification for its 
validity. Only recently, in [26) . in a technically brilliant paper, the authors were 
able to prove that the number of isolated critical points of the potential is finite. 
Yet, even for 3 (!) charges their separate proof for the possible number of isolated 
critical points yields the estimate < 12 instead of < 4 asserted by Maxwell. A 
slight improvement of the estimate in |26] under an additional assumption that 
all n charges (in R 3 ) lie in the same plane can be found in [36]. Also, we refer 
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the reader to [26] for the history of the problem and a rather complete, although 
surprisingly short, list of references that exist up to this date. 

We formulate an analog of Conjecture [9] in higher dimensions. 

Conjecture 10. Let u be a probability measure supported on a finite subset ofW 1 . 
Then 

h(S min (^),V(n)) < o-^ifi), 
where V{[i) denotes the set of critical points of the Newtonian potential. 

10.3. Miller's local maxima 

Let n > 2, p € [1, oo] and let S p (n) denote the set of all monic nth degree polyno- 
mials F for which cr p (F) ^ 1. Since h(F,F') is continuous as a function of F on 
the compact set S p (n), h(F,F') has an absolute maximum on S p (n). Conjecture 
1 is equivalent to the statement that the polynomials F(z) = (z — c) n — to where 
\u)\ = 1 are absolute maxima for h(F,F') on S p (n). Conjecture 2 is equivalent to 
the statement that for p > 1, these polynomials are the only absolute maxima for 
h(F,F') on S p (n). 

In [43], Miller looked at the local maxima of h(F , F') on 6*00 (n). It was known 
that all polynomials of the form F(z) = {z— c) n — u where \w\ = 1 are local maxima 
of h(F, F') on Soo(n) [42] [58]. Miller found eight properties which together formed 
a sufficcnt condition for a polynomial to be a local maximum of h(F, F') on Sooin). 
He then found polynomials which possess all eight properties; these polynomials 
were local maxima but were not of the form (z — c) n — uj [43]. These unexpected 
local maxima are particularly useful as test cases and possible counterexamples for 
conjectures. The polynomial which disproved the conjecture in Section 3 that every 
column of a Gauss-Lucas matrix of nth degree polynomial has an element greater 
than or equal to ± was F(z) = z 19 -0.881444934z 18 +0.896690269z 17 -0.492806889 
which is a local maximum of h(F, F') on S' 00 (19) found by Miller in [43]. It would 
be very instructive and useful to study the local maxima of h{F,F') on S p (n) for 
finite p of which little is known. 
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